Ensemble of Linear Models for Predicting Drug Properties

نویسندگان

  • Tomasz Arodz
  • David A. Yuen
  • Arkadiusz Z. Dudek
چکیده

We propose a new classification method for the prediction of drug properties, called random feature subset boosting for linear discriminant analysis (LDA). The main novelty of this method is the ability to overcome the problems with constructing ensembles of linear discriminant models based on generalized eigenvectors of covariance matrices. Such linear models are popular in building classification-based structure-activity relationships. The introduction of ensembles of LDA models allows for an analysis of more complex problems than by using single LDA, for example, those involving multiple mechanisms of action. Using four data sets, we show experimentally that the method is competitive with other recently studied chemoinformatic methods, including support vector machines and models based on decision trees. We present an easy scheme for interpreting the model despite its apparent sophistication. We also outline theoretical evidence as to why, contrary to the conventional AdaBoost ensemble algorithm, this method is able to increase the accuracy of LDA models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting distribution of Eurasian Lynx (Lynx lynx) using an ensemble modeling approach: A Case Study: Saveh Zarandieh Kharaghan Area, Markazi Province

Adequate knowledge about suitable habitats for wildlife is essential to prevent habitat destruction and extinction of species and for their conservation and management. The Eurasian lynx is one of the mostly distributed cats in Asia. In this study, we applied an ensemble habitat suitability modeling approach, using ten predictor variables to model Eurasian Lynx’s habitat suitability in Saveh Za...

متن کامل

Predicting Protein Binding of Drugs Using Abraham Parameters: Effect of Ionization

Background and purpose: Protein binding (PB) is an important pharmacokinetic parameter in drug discovery and development. In past years Abraham parameters were used to predict some physicochemical and pharmacokinetic properties of drugs. But in these cases, the ionization of drugs in blood pH (7.4) was ignored. Recently, Abraham parameters of chemical compounds in ionized form are proposed. Als...

متن کامل

QSPR models to predict thermodynamic properties of some mono and polycyclic aromatic hydrocarbons (PAHs) using GA-MLR

Quantitative Structure-Property Relationship (QSPR) models for modeling and predicting thermodynamic properties such as the enthalpy of vaporization at standard condition (ΔH˚vap kJ mol-1) and normal temperature of boiling points (T˚bp K) of 57 mono and Polycyclic Aromatic Hydrocarbons (PAHs) have been investigated. The PAHs were randomly separated into 2 groups: training and test sets. A set o...

متن کامل

Current Models for Predicting Drug-induced Cholestasis: The Role of Hepatobiliary Transport System

Drug-induced cholestasis is the main type of liver disorder accompanied by high morbidity and mortality. Evidence for the role of hepatobiliary pumps in the cholestasis patho-mechanism is constantly increasing. Recognition of the interactions of chemical agents with these transporters at the initial phases of drug discovery can help develop new drug candidates with low cholestasis potential. Th...

متن کامل

Current Models for Predicting Drug-induced Cholestasis: The Role of Hepatobiliary Transport System

Drug-induced cholestasis is the main type of liver disorder accompanied by high morbidity and mortality. Evidence for the role of hepatobiliary pumps in the cholestasis patho-mechanism is constantly increasing. Recognition of the interactions of chemical agents with these transporters at the initial phases of drug discovery can help develop new drug candidates with low cholestasis potential. Th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 46 1  شماره 

صفحات  -

تاریخ انتشار 2006